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Abstract. The smallest positive eigenvalue of the Laplacian of a net- 
work is called the spectral gap and characterizes various dynamics on 
networks. We propose mathematical programming methods to maximize 
the spectral gap of a given network by removing a fixed number of nodes. 
We formulate relaxed versions of the original problem using semidefinite 
programming and apply them to example networks. 
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1 Introduction 

An undirected and unweighted network (i.e., graph) on N nodes is equivalent 
to an N x N symmetric adjacency matrix A = (Ay), where Ay = 1 when 
nodes (also called vertices) i and j form a link (also called edge), and Ay = 
otherwise. We define the Laplacian matrix of the network by 

L = D-A, (1) 

where D is the N x N diagonal matrix in which the zth diagonal element is equal 
to Ylj=x Aij, i.e., the degree of node i. 

When the network is connected, the eigenvalues of L satisfy 

Ai = < A 2 < ■•• < X N . (2) 

The eigenvalue A2 is called spectral gap or algebraic connectivity and charac- 
terizes various dynamics on networks including synchronizability [51I31HD]. speed 
of synchronization [3] , consensus dynamics [17] , the speed of convergence of the 
Markov chain to the stationary density [HUE] , and the first-passage time of the 
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random walk [W. Because a large A2 is often considered to be desirable, e.g., for 
strong synchrony and high speed of convergence, maximization of A2 by changing 
networks under certain constraints is important in applications. 

In the present work, we consider the problem of maximizing the spectral gap 
by removing a specified number, Add, of nodes from a given network. We assume 
that an appropriate choice of N$ e i nodes keeps the network connected. A heuris- 
tic algorithm for this task in which nodes are sequentially removed is proposed 
in [5D] . In this study, we explore a mathematical programming approach. We pro- 
pose two algorithms using semidefinite programming and numerically compare 
their performance with that of the sequential algorithm proposed in |20) . 

2 Methods 

We start by introducing notations. First, the binary variable X{ (1 < i < N) 
takes a value of if node i is one of the iVdei removed nodes and 1 if node i 
survives the removal. Our goal is to determine Xi (1 < i < N) that maximizes 
A2 under the constraint 

N 

^Xi = N- N dch (3) 

i=l 

Second, we define as the N x N Laplacian matrix generated by a single link 
£ E, where E is the set of links. In other words, the and elements 
of Lij are equal to 1, the and elements of Lij are equal to —1, and all 
the other elements of Lij are equal to 0. It should be noted that 

L= ]T Uj. (4) 

l<i<j<N;(i,j)eE 

Third, J denotes the N x N matrix in which all the N 2 elements are equal to 
unity. Fourth, Ei denotes the N x N diagonal matrix in which the (i, i) element 
is equal to unity and all the other A^ 2 — 1 elements are equal to 0. 

After the removal of Adei nodes, we do not decrease the size of the Laplacian. 
Instead, we remove L^ from the summation on the RHS of Eq. @ if node i or 
j has been removed from the network. The Laplacian of the remaining network, 
if connected, has iVdei + 1 zero eigenvalues. The corresponding zero eigenvectors 
are given by = (1 • • • 1) T and e^, where T denotes the transposition, is 
the unit column vector in which the ith element is equal to 1 and the other N — 1 
elements are equal to 0, and i is the index of one of the N^ei removed nodes. 

We formulate a nonlinear eigenvalue optimization problem, which we call 
EIGEN, as follows: 

maximize t subject to Eq. ([3]) and 

N 

-tl+ ^2 XiXjLij + aJ + /3^(1 - Xi)Ei ^ 0, (5) 

i<j;(i,i)EE '=1 
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and Xi 6 {0, 1} (1 < i < N), where >: indicates that the LHS is a posi- 
tive semidefinite matrix. The positive semidefinite constraint Eq. ([5]) is derived 
from a standard prescription in semidefinite programming for optimization of 
an extreme eigenvalue of a matrix. Maximizing t is equivalent to maximizing 
the smallest eigenvalue of the matrix given by the sum of the second, third, and 
fourth terms on the LHS of Eq. ([5]). 

Without the third and fourth terms on the LHS of Eq. ([5]), the optimal solu- 
tion would be trivially equal to t = because the Laplacian of any network has 
as the smallest eigenvalue. Because J = u^u^ T , the third term transports 
a zero eigenvalue to ss a. We should take a sufficiently large a > such that the 
zero eigenvalue is shifted to a value larger than the spectral gap of the remain- 
ing network, denoted by A2 . This technique was introduced in [§] for solving the 
traveling salesman problem. 

For each removed node i (i.e., Xi =0), the matrix represented by the second 
term on the LHS of Eq. ([5]) has a zero eigenvalue associated with eigenvector e^. 
The fourth term shifts this zero eigenvalue to « (3. Note that the fourth term 
disappears for the remaining N — iVdoi nodes because Xi — 1 for the remaining 
nodes. If the shifted eigenvalues are larger than A2, the solution to the problem 
stated above returns the N^ei nodes whose removal maximizes A2. 

The second term on the LHS of Eq. ([5]) represents a nonlinear constraint. 
To linearize the problem in terms of the variables, we follow a conventional 
prescription to introduce auxiliary variables 

X^j — X^Xj, (6) 

where 1 < i < j < N (also reviewed in [H]). If Xi is discrete, Xi(l — x{) = 

holds true. Therefore, we require Xu = x\ = Xi. In the following discussion, 
we use Xi in place of Xu. 

We define the (N + 1) x (N + 1) matrix 



Y = 



1 x> 
x X 



(7) 



where x = (x\ . . . x^) T , the element of the N x N matrix X is equal to 
Xi, and the element (i =/= j) of X is equal to Xy. By allowing Xi and X^ 
(1 < i < j < N) to take any continuous value between and 1, we define the 
relaxed problem named SDP1 as follows: 

maximize t subject to Eq. ([3]) and 

N 

-tl + XijLij+aJ + 0^(1 - Xi)Ek h 0, (8) 

i<j;(i,j)eE i=l 

Y t0. (9) 

Note that Eq. © implies < x, t < 1 (1 < i < N) and that SDP1 relaxes 
the original problem in that Xi and X^ are allowed to take continuous values 
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while Eq. ^ is imposed. The method that we propose here for approximately 
maximizing the spectral gap is to remove the A^el nodes corresponding to the 
iVdei smallest values among xi, . . ., xm in the optimal solution of SDP1. 

SDP1 involves N(N + l)/2 + 1 variables (i.e., t, Xi, and Xij with i < j). 
In fact, for £ E is free unless Eq. © is violated; it does not appear 

in the main positive scmidefinite constraint represented by Eq. ([8]). Because a 
given network is typically sparse, this implies that there are many redundant 
variables in SDP1. To exploit the sparsity and thus to save time and memory 
space, a technique based on matrix completion might be useful [ITJ[l6]. In this 
paper, however, we propose another relaxation SDP2 for this purpose. 

To linearize the second term on the LHS of Eq. ([5]), we take advantage of 
four inequalities XiXj > 0, Xi(l — Xj) > 0, (1 — Xi)Xj > 0, and (1 — Xi)(l — Xj) > 
that must be satisfied for any link £ E. By defining = X^Xj j cLS 111 the 

case of SDP1, we obtain the following four linear constraints [18] : 

Xij >0, (10) 

Xl - Xij >0, (11) 

Xj - X^ >0, (12) 

1-Xi-Xj + X^ >0. (13) 

SDP2 is defined by replacing Eq. ^ by Eqs. (fTUjl - lfll?)) . where only the pairs 
6 E arc considered. Note that Eqs. (|TU)) -(fT3" |) guarantee < Xi < 1 (1 < 
i < N). We remove the iVdci nodes corresponding to the TVdei smallest values 
among x\, . . ., xjv in the optimal solution of SDP2. 

Numerically, SDP2 is much easier to solve than SDP1 for two reasons. First, 
the number of variables is smaller in SDP2 than in SDP1. In SDP2, X^ is 
defined only on the links, whereas in SDP1 it is defined for all the pairs 1 < 
i < j < N. In sparse networks, the number of variables is 0(N 2 ) for SDP1 
and O(N) for SDP2. Second, the positive semidefinite constraint, which is much 
more time consuming to solve than a linear constraint of a comparable size, is 
smaller in SDP2 than in SDP1. While SDP1 and SDP2 share the N x N positive 
semidefinite constraint ©, SDP1 involves an additional positive semidefinite 
constraint ^ of size (N + 1) x (N + 1). 

To determine the values of a and /3, we consider the matrix represented 
by the sum of the second, third, and fourth terms on the LHS of Eq. ([5]). A 
straightforward calculation shows that the eigenvalues of this matrix are given 
by the N— N<ic\— 1 positive eigenvalues of the Laplacian of the remaining network, 



(iV de i - l)-fold P, and (3 + aN - p ± ^/(aN - P) 2 + 4N dcl aP /2. For a fixed 



P, we should select a to maximize P + aN — P — (aN — P) 2 + ANd \ap /2 
which is always smaller than eigenvalue p. We set 

P 

a = — 

N 



(14) 



to simplify the expression of this eigenvalue to P(l — \/Nd c i/N) while approxi- 
mately maximizing this eigenvalue. 
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We have the following bounds for the optimal solution to the original prob- 
lem. We denote by the optimal solution, i.e., the maximum spectral gap 
with A*doi nodes removed. We denote by A§ DP the smallest positive eigenvalue 
of the network obtained by the proposed method; the proposed method removes 
the TVdei nodes corresponding to the Add smallest values of Xi, . . ., xn in the 
optimal solution of SDP1 or SDP2. Obviously, A| DP is a lower bound for An P . 
On the other hand, the optimal value, maxt, of SDP1 or SDP2 serves as an 
upper bound for , as long as the (3 satisfies A^' < /3(1 — ^/Add/A). This 
follows from the facts that the optimal value of EIGEN with such a j3 value 
coincides with A°; pt and both SDP1 and SDP2 are a relaxation of EIGEN. We 
can summarize our observation as follows: A§ DP ^ ^2 P * — m axi- 

3 Numerical results 

In this section, we apply SDP1 and SDP2 to some synthetic and real networks. 
We implement SDP1 and SDP2 using the free software package SeDuMi 1.3 that 
runs on MATLAB 7.7.0.471 (R2008b) [T]. 

We compare the performance of SDP1 and SDP2 with that of the optimal 
sequential method, which is a heuristic method proposed in [5D] . In the optimal 
sequential method, we numerically calculate the spectral gap for the network 
obtained by the removal of one node; we do this for all possible choices of a 
node to be removed. Subsequently, we remove the node whose removal yields 
the largest spectral gap. Then, for the remaining network composed of A — 1 
nodes, we determine the second node to be removed in the same way. We repeat 
this procedure until Add nodes have been removed. 

The first example network is the well-known karate club social network, in 
which a node represents a member of the club and a link represents casual 
interaction between two members [3T]. The network has A — 34 nodes and 78 
links. We set (3 — 2. The spectral gaps obtained by the different node removal 
methods are shown in Fig. HJa) as a function of Add- Up to Add = 5, the 
optimal sequential method yields the exact solution, as do SDP1 and SDP2. 
For Ad e i > 6, we could not obtain the exact solution by the exhaustive search 
because of the combinatorial explosion. For 7 < Am < 16, SDP1 and SDP2 
perform worse than the optimal sequential method. However, for Add > 17, 
both SDP1 and SDP2 outperform the optimal sequential method. SDP1 and 
SDP2 found efficient combinations of removed nodes that the optimal sequential 
method could not find. 

Second, we test the three methods against the largest connected component 
of the undirected and unweighted version of a macaque cortical network [19] . 
The network has A = 71 nodes and 438 links. We set /3 — 2. The spectral 
gaps obtained by the different methods are shown in Fig. |Tfb) . Up to Add = 
4, the optimal sequential method yields the exact solution, as do SDP1 and 
SDP2. For Aid > 5, we could not obtain the exact solution because of the 
combinatorial explosion. For Add > 5, SDP1 and SDP2 perform worse than 
the optimal sequential method. Consistent with the poor performance of SDP1 
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and SDP2, the final values of xi (1 < i < N) are not bimodally distributed 
around and 1 as SDP1 and SDP2 implicitly suppose. The distribution is rather 
unimodal except for the first three values of Xj that are close to 0. The ten 
values of Xi when A^ei = 5, in ascending order, are as follows: £33 = 0.1086, 
2 62 = 0.1531, x 53 = 0.1589, xi = 0.4813, x 2 = 0.5246, x 8 = 0.5591, x 7 = 0.6449, 
x 24 = 0.78 66, x 51 = 0.8 7 49, and x 63 = 0.8931 in SDP1, and x 53 = 0.000, 
x 33 = 0.145, x 62 = 0.177, x 2 = 0.585, x x = 0.588, x 8 = 0.6 1 0, x 7 = 0.668, 
x 2 4 = 0.708, x 5 = 0.738, and x± = 0.937 in SDP2. 




Fig. 1. Spectral gap as a function of the number of removed nodes for four networks, 
(a) Karate club social network with N = 34 nodes, (b) Macaque cortical network 
with N = 71 nodes, (c) Barabasi-Albert scale-free network with iV = 150 nodes, (d) 
C. elegans neural network with TV = 279 nodes. 



The third network is a network with N = 150 nodes generated by the 
Barabasi-Albert scale-free network model [S]. The growth of the network starts 
with a connected pair of nodes, and each incoming node is assumed to have two 
links. The generated network has 297 links. We set (3 = 2. For this and the next 
networks, SDP1 cannot be applied because N is too large. Therefore, we only 
compare the performance of SDP2 against the optimal sequential method. The 
results shown in Fig. [3c) indicate that SDP2 outperforms the optimal sequential 
method when JVdci > 7. 
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The fourth network is the largest connected component of the C. elegans 
neural network [21 E]. Two nodes are regarded as being connected when they 
are connected by a chemical synapse or gap junction. We ignore the direction 
and weight of links. The network has N — 279 nodes and 2287 links. We set 
f3 = 2.5. The results for SDP2 and the optimal sequential method are shown 
in Fig. [IJd). Although the spectral gap gradually increases with iVdci for SDP2, 
SDP2 performs poorly as compared to the optimal sequential method for this 
example. 



4 Discussion 

We proposed a method to maximize the spectral gap using semidefinite program- 
ming. The two proposed algorithms have a firmer mathematical foundation as 
compared to the heuristic numerical method (i.e., optimal sequential method). 
The proposed algorithms performed better than the heuristic method for two 
networks especially for large iVdei and worse for the other two networks. For the 
former two networks, we could find the solutions in the situations in which the 
exhasutive search is computationally formidable. Up to our numerical efforts, 
our algorithms seem to be efficient for sparse networks. 

We should be careful about the choice of j3. If (3 is too large, SDP1 and SDP2 
would result in xi ~ N<i e \/N (1 < i < N). This is because setting Xi — N^ B \/N 
(1 < i < N) makes the fourth term on the LHS of Eq. © equal to f3 N ~^ d ° l I, 
which increases all the eigenvalues, including the spectral gap of the remaining 
network, by (3 N ~^ d ° l ■ In contrast, if /3 is smaller than A 2 , SDP1 and SDP2 
would maximize a false eigenvalue originating from the fourth term on the LHS 
of Eq. Q. 

To enhance the performance of SDP1 and SDP2, it may be useful to abandon 
the convexity of the problem. For example, we could try replacing (1 — Xi) in the 
fourth term by (1 — Xi) p and gradually increase p from unity. When p > 1, the 
problem is no longer convex. Accordingly, the existence of the unique solution 
and the convergence of a proposed algorithm are not guaranteed. Nevertheless, 
we may be able to track the optimal solution x by the Newton method while 
we gradually increase p (see p. 5 and p. 63 in [6). An alternative extension is to 
add — f>5Zi=i ~ x i) t° the objective function to be maximized (i.e., t). When 
p > 0, the convexity is violated. However, we may be able to adopt a procedure 
similar to the method explained above, i.e., start with p = and gradually 
increase p to track the solution by the Newton method. 
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